
CRF Problem Report 

The Scientific and Technical Information Center (STIC) experienced a problem 
_when processing the following computer_readab.le.form (CRF): . 

Application Serial Number: 09y<o 79 9/3 (^PC / } 

Filing Date: // - 8^00 

Date Processed by STIC: >2- S'-Q O 

STIC Contact: Mark Spencer, 703-308-4212 ► 
Nature of Problem: 

The CRF (was): 

(circle one) Damaged or Unreadable (for Unreadable, see attached) 

Blank (no files on CRF) (see attached) 

Empty file (filename present, but no bytes in file) (see attached) 

Virus-infected. Virus name: The STIC will not process the CRF. 

Not saved in ASCII text 

Sequence Listing was embedded in the file. According to Sequence Rules, 

submitted file should only be the Sequence Listing. 

Did not contain a Sequence Listing, (see attached sample) 



y Other: /O^n- cory^/^c.^^ ^e^oe^^c -f^r^^-f ^ 



PLEASE USE THE CHECKER VERSION 3.0 PROGRAM TO REDUCE ERRORS. 
SEE BELOW FOR DETAILS: 

Checker Version 3.0 

The Checker Version 3.0 application is a state-of the-art Windows based software program 
employing a logical and intuitive user-interface to check whether a sequence listing is in 
comphance with format and content rules. Checker Version 3.0 works for sequence listings 
generated for the original version of 37 CFR §§1.821 - 1.825 effective October 1, 1990 (old 
rules) and the revised version (new rules) effective July 1, 1998 as well as World Intellectual 
Property Organization (WIPO) Standard ST,25. 

Checker Version 3.0 replaces the previous DOS-based version of Checker, and is Y2K- 
compliant. Checker allows public users to check sequence listings in Computer Readable form 
(CRF) before submitting them to the United States Patent and Trademark Office (USPTO). 
Use of Checker prior to filing the sequence listing is expected to result in fewer errored sequence 
listings, thus saving time and money. 



Checker Vprsion 3.0 can be down loaded from the USPTO website at the following address: 

http://ww>v.uspto.gov/web/offices/pac/checker 



SEQUENCE LISTING 

COMMON FOR ALL SEQUENCES. 
SEQUENCE TYPE: Peptide 
SEQUENCE UNIT ; Amino Acid 
TOPOLOGY: Linear 



^es Not Comply 
Corrected Diskette Needed 



y s 



SEQUENCE ID NO: 1 

SEQUENCE LENGTH: 19 amino acids 
NVPGHERMGRGRTSSKELA 
1 5 



10 



15 



SEQUENCE ID NO: 2 

SEQUENCE LENGTH: 27 amino acids 

RLEAKHRENVPGHERMGRGRTSSKELA 

15 10 15 20 

25 



SEQUENCE ID NO: 3 

SEQUENCE LENGTH: 17 amino acids 
RLEAKHRENVPGHERMG 

15 10 15 



SEQUENCE ID NO: 4 

SEQUENCE LENGTH: 12 amino acids 
MGRGRTSSKELA 

15 10 



SEQUENCE ID NO: 5 

SEQUENCE LENGTH: 15 amino acids 



Federal R egstej|pjg l,-63/ No. 104 /Monday, June l,^ 9\ 



Appendix A To Subpart G.to Part I— Sample Sequence Listing^ ^ , 
<110> Sodth, John 

Smith, Jane 




n 



<120> Example of a Sequence Listing 



<130> 01-00001 



<140> US Oe/999,999 



<141> 1998-02-28 



<1S0> EP 91000000 



<151> 1997-12-31 



:S2964'6i4- Federal Register >^p. 63. Nor 104 /M onday. June 1, 1998 J^es m&^^fjif^ 



<160> 2 



:170> Patentin ver. 2.0 



<210> 1 



<211> 403 



<212> DNA 



<213> Paramecium aurelia 



<220> 



<221> CDS 



<222> 341. .394 



<300> 



<301> Doe, Richard 



<302> Isolation and Characterization of _a Gene Encoding a 



Protease from Paramecium sp. 



<303> Journal of Fictional Genes 



Federal Register]^. 63. Uo;j0im^ 1. 1998^es.^i^j^^o 



<304> 1 ^ 



<305> 4 



<306-> 1-7 



<307> 1988-06-20 



<400> 1 



ctactctact ctactctcat ctactatctt ctttggatct ctgagtctgc ctgagtggta 60 



ctcttgagtc ctggagatct ctcctctcac atgtgatcgt cgagactgac cgatagatcg 120 



ctgactgact ctgagatagt cgagcccgta cgagacccgt cgagggtgac agagagtggg 180 



cgcgtgcgcg cagagcgccg cgccggtgcg cgcgcgagtg cgcggtgggc cgcgcgaggg 240 



ctttcgcggc agcggcggcg ctttccggcg cgcgcccgtc cgcccctaga octgagaggt 300 



4» O 



cttctcttcc ctcctcttca ccagagaggt ctatatatac atg gtt tea atg ttc 

Met Val Ser Met Phe 



355 



age ttg tct etc aaa tgg cct gga ttt tgt ttg ttt gtf'tgtt'tgctc 

I 

Ser Leu Ser Phe Lys Trp Pro Gly Phe Cys Leu Phe Val 
10 IS - 



<210> 2 
<211> 18 
<212> PRT 

<213> Paramecium aurelia 
<400> 2 



<120> 

<130> 

<140> 

<141> 

<150> 

<151> ........v.. 

<160> 



<210> 
<211> 



r^r-Oefinition *.t 



rt\\eM\0r\an. 

Hie Reference 

CurrenTAppncatlon 
Number. ' 

Currenl F3ir>gpate 

Prior AppQca (ton Nurn-. 

bate. 
Nurnber of S^^^ 

SEQ ID N0:#: 

Length 



Commcnls and ^orrnf\^Vg^^y- - 



Preferabty max. of 10 names; one name/per- 
preferable lomial: Surname, Olhcr Names:; 
or Initials. 




Personal file reference 

Specify as: US 07/999.999 or PCTAJ 596/99999-^ 



Specify as: yyyy-mm-dd 

Specify.as: US 07/999.999 or PCT/US96/99999 

Specify as: yyyywnm-dd 

Count Includes total number of SEQ IDrNOs ....... 

Nartic of sbhwiire used to c/e'ate 'tl*B" Sequcf>c(B 
Listing. 

Response shall be an integer representing the 
- SECViO NO shown. 

Respond with an Integer expressing the number 
of bases or amino acid residues, 



M whefiTilcd prior to assignment of appl. num- 
ber. — . 

M iL5vai|§b!a--rf-<c: 
M, if available. 

M. if applicable include priority documents under 

35:useSil9:ahdvf20; 

M, If ap^icable. 

M. 



M. 
M. 



J!Iameric Iden- 
■ liHcr 



OeHnitjon 



Comments and. format 



Maf»5&h)ry |M) or optional (O). 



<212> 



<213> 



<220> 



<221> 



<222> 



<223> 



Type 



Organism 



Feature 



Name/Key 



Location 



Other Information 



<300> 
<301> 

<302> 
<303> 
<304> 
<305> 
<306> 
<307> 

<308>. 

<309> 

<310> 

<311> 

<312> 

<313> 
<400> 



Publication Infomiation 
Authors 

Title 

Journal 

Volume 

Issue 

Pages 

Date 

Database Accession 

Number. 
Database Entry Dale ,. 

Patent Document Num- 
ber. 

Patent Filing Dale 

Publication Date 

Relevant Residues 
Sequence 



Whether presented sequence molecule Is DNA. 
RNA, or PRT (protein). If. a nucleotide se- 
quer>ce contains both DNA and RNA frag- 
ments, the type shall be ''DNA." In addition, 
the combined DNA/RNA molecule shall be fur- 
ther described In the <220> to <223> feature 
section. 

Scientific name. I.e. Genus/ species. Unknown or 
Artifidal Segucnce. In addition, the "Unknown" 
tu "ArtiGcial Sequence" organisms shall be fur- 
' ther described In the <220> to <223> feature 
section 

Leave blank after <220>, <221-223> provide for 
a dcscdption of points of biological significance 
In the sequence.. 

Provide appropriate identifier lor (oature. pref- 
erably Irom WiPO Standard ST.25 (1998). A(> 
pcndix 2. Tables 5 and 6. 

Specify location within sequence; where appro- 
priate state number of first and last bases/ 
amino acids in feature. 

Other relevant information; four lines maximum ... 



Leave blank atter <300> ; 

Preferably max of ten named authors of publica- 
tion; specify one name per line; preferable for- 
mat: Surname. Other Names and/or Initials. 

Joumal date on which data published; specify as 
VTyy-mm-dd. MMM-yyyy or Season-yyyy. 

Accession number assigned by database indud* 
ing database name. 

Date of entry in database; specify as yyyy-mm- 
dd or MMM-yyyy, 

Document number; lor patent-type citations only. 
Specify as. lor example. US 07/999.999. 

Document filing date, for patent-type citations 
only; specify as yyyy-mm-dd. 

Document publication date, lor patent-type dta- 
lions only; spcdfy as yyyy-mm-dd. 

FROM (position) TO (posHion) 

SEQ ID NO should follow the numeric identifier 
and should appear on the line preceding the 
actual sequence. 



M. 



M 



M, under the lollowing conditions: if "n," -Xaa," 
or a modified or unusual L-amino add or modi- 
fied base was used in a sequence; if ORGA- 
NISM is "Artificial Sequence" or "Unknown*; K 
molecule is combinc(l Dlg/yRNA" 

M, under the lollowing conditions; if "n." "Xaa," 
or a modified or unusual L-amino add or modi- 
fied base was used in a sequence. 

M. under the following conditions: if "n.** "Xaa," 
or a modified or unusual L-amIno add or modi- 
fied base was used in a sequence. 

M. under the following conditions: if "n," *'Xaa.*' 
or a modified or unusual L-amino add or modi- 
fied base was used in a sequence; if ORGA- 
NISM is "Artifidal Sequence" or "Unknown"; if 
molecule is combined DNA/RNA. 

O. 
O. 



O. 
O. 
O. 
O. 
O. 
O. 

O. 

O. 

O. 

O. 

O. 

O. 
M. 



